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This paper summarizes the application of two adaptive approaches to autopilot design, 
and presents an evaluation and comparison of the two approaches in simulation for an 
unmanned aerial vehicle. One approach employs two-stage dynamic inversion and the other 
employs feedback dynamic inversions based on a command augmentation system. Both are 
augmented with neural network based adaptive elements. The approaches permit adaptation 
to both parametric uncertainty and unmodeled dynamics, and incorporate a method that 
permits adaptation during periods of control saturation. Simulation results for an FQM- 
117B radio controlled miniature aerial vehicle are presented to illustrate the performance of 
the neural network based adaptation. 
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Nomenclature 

normal acceleration ( a n = - a . ) 

acceleration along the body x, y, and z-axis, respectively 
moment of inertia in x, y, and z-axis, respectively 
rotational rate along the body x, y, and z-axis, respectively 
elevator, aileron, and rudder control deflection, respectively 
modeling error 
pseudo-control 


subscripts 

c = command parameter 

s = stability axis 

h = hedging 


I. Introduction 

R ecent technology developments allow unmanned aerial vehicles (UAVs) to displace manned aircraft in many 
commercial and military roles. As these roles are expanded from simple reconnaissance missions to more 
complex missions, there is an increasing need for control systems that are robust to model uncertainty due to 
incomplete modeling, malfunction, or damage during operation. A challenge to designers of flight control systems is 
to achieve highly maneuverable UAVs without requiring accurate modeling of these vehicles. Adaptive flight 
control designs provide a way to deal with the uncertainties in the system and environment, without sacrificing 
performance. 

Most UAV developments are based on simple and cheap systems with minimal mass, having minimal or no 
aerodynamic data for control design. Therefore, control design for UAVs should take these uncertainties into 
account. UAV dynamics are also significantly affected by their payloads, which can vary depending upon their 
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mission. Therefore, it is highly desirable to employ an approach to flight control design that is low cost and does not 
require extensive timing of gain tables. Adaptive approaches to control system design are ideally suited for this 
application. 

This paper will illustrate the use of neural network (NN) based adaptive control designs for a UAV. The main 
objective of controller design is to demonstrate adaptation to model uncertainties such as unknown or inaccurate 
mass properties and unknown aerodynamic derivatives, as well as external aerodynamic disturbances such as wind 
gust that can significantly impact UAV flight performance at low speeds. 

This paper presents two NN -based adaptive flight controls that have been successfully utilized for a variety of 
aerospace applications 1 ' 2 ' 3 , incorporating recent advances in the area of state/output feedback and adaptation under 
saturated control conditions. One approach is based on a two-stage dynamic inversion with approximate feedback 
linearization and synthesis of a fixed-gain linear compensator, and the other approach is a command augmentation 
system based dynamic inversion control, while both incorporating NNs as adaptive elements to compensate for the 
modeling errors, unmodeled dynamic characteristics of the plant 4 ". The effects of control saturation are also directly 
accounted for in the design of the adaptive controller through pseudo-control hedging (PCH) 6 . 

The UAV (FQM-117B) used for this study is described in Section II. A two-stage dynamic inversion based 
adaptive control design follows in Section III. A command augmentation system based adaptive control design is 
presented in Section IV. PCH to handle control input nonlinearities is described in Section V, and NNs are briefly 
discussed in Section VI. Simulation results are presented in Section VII. Conclusions are given in Section IX. 

II. The UAV, FQM-117B 

The UAV used for this research is the FQM-117B radio controlled miniature aerial vehicle shown in Fig. I 7 ' 8 , 
which is roughly a 1/9 scale version of Russian fighter aircraft MIG-27. This UAV is composed entirely of 
injection-molded Styrofoam, and has a 1.70 m wingspan, 1.88 m length, and a total vehicle weight of approximately 
6.72 kg. It is powered by a 0.60 cubic inch, 1.9 HP glow fuel engine and has elevator, ruder and full span ailerons. 
Its moments of inertia are approximately I xx = 0.2622 , I n , = 1.2628 , I zz = 1.5361 , I xz = -0.0708 , and 

r „ =[ vz = 0 kg-m 2 . 




Figure 1. FQM-117B UAV 
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For the control design of the UAV, only simple static wind tunnel test data were available along with its mass 
properties. The vehicle employs inexpensive instrumentation, that are noisy and possess a significant amount of 
bias, drift and scale factor error. In addition, it is clear that its mass property changes significantly as fuel is 
consumed, and its flight envelope includes low altitude and low speeds where air disturbances such as gust are 
common. 

III. Control Design 1: Two-Stage Dynamic Inversion Based Adaptive Control Design 

In this design approach, angle of attack (or), sideslip angle (p) and stability axis roll rate (p s ) are commanded. As 
shown in Fig. 2, the pilot’s command is inputted to the command filters to generate reference signals, while 
employing pseudo-control hedging (PCH) to protect the adaptive process from effects due to control saturation. 
Next proportional and derivative (PD) controllers are used to follow the reference commands. The control 
commands are obtained by a two-stage dynamic inversion. Since there is no a and P sensor, the required feedbacks 
are assumed to be computed by integration of IMU sensor outputs. The PCH and the NN signals shown in the figure 
are further discussed in Section V and VI. 



Figure 2. Adaptive feedback control architecture. 


A. Two-stage Dynamic Inversion 

A two-stage approach for dynamic inversion has been developed for designing a flight control system that 
regulates \p s a P | 2 ' 9 ' 1CU1 . The structure of the inverting law and its implementation is displayed in Fig. 3, where the 
states for the stage 1 dynamics are ij =[a a P P (f> 0 V] T and those for stage 2 dynamics are 
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Figure 3. Two-stage dynamic inversion control law structure. 
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The control variables for the stage 1 dynamics are the angular accelerations in the roll, pitch and yaw stability axis 
frame, u\=\p sc q sc r sc ] 7 and the control variables for the stage 2 dynamics are the effective control 

displacement commands in each axis, u 2 = [S ac 8 ec 8 rc ] 7 . The regulated variables in each stage are, y, = \p s . a. / 

andy 2 = [p S i<h r s Y ■ Note that the regulated variables of the stage 1 dynamics are related to regulated variables [p s 
a P ] according to the relative degree of each regulated variable. 

Subject to a set of approximations 14 the stage 1 dynamics can be expressed in the following form 
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where p s and r s denote the stability axis roll and yaw rates. Similarly, the stage 2 dynamics can be expressed as 
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B. Computation of the control 

Consider the stage 2 dynamic equation expressed as 


Then it follows that 


x 2 = A(x) + B(x)-u 2 
T2 = *2 


j >2 = x 2 = A(x) + B(x) ■ U 2 
= A(x) + B(x) ■ u e = u | 


The stage 1 dynamic equation is given as 


y\ = F{x) + G(x ) • U\ = v 


(2) 


(3) 

(4) 

(5) 


where v is the pseudo control. Combining Eq. (4) and Eq. (5), we have the commanded control that is applied to the 
aircraft. 


u = (G(x)fi(x)) l [v - {F(. x) + G(x)A(x)}] 


(6) 


where G(x ) , B(x ) , F(x) and A(x) denote estimates of G(x), B(x), F(x) and A(x). 

C. Control Architecture 

The pseudo-control for this state feedback control design has the form 

v = x { J ] + v dc -v ad (7) 

where x\! ■ is output of an r^'-order reference model that is used to define the desired closed loop response, v dc is 
the output of a dynamic compensator, and v ad is the adaptive signal. The error dynamics for the state feedback can 
be expressed as 
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( 8 ) 



-*« 


= ~Vdc + V ad ~ A 


It is apparent that the dynamic compensator should be designed to stabilize Eq. (8), and that the role of vv is to 
cancel A. 


IV. Control Design 2: Command Augmentation Based Adaptive Control Design 

As shown in Fig. 4, the acceleration commands to the UAV are first converted to P, Q and R commands (P c , Q c , 
R c ) through an outer-loop controller, while ensuring the vehicle’s stability and maintaining trimmed sideslip angle 
during maneuvers 12 . Then first order reference models are inserted in each channel to generate reference signals, 
while employing PCH to protect the adaptive process from effects due to control saturation. Next proportional 
controllers are used to follow the reference commands P rm , Q rm and R rm . The output of the controller is a part of the 
total pseudo control v, which is the desired angular acceleration. The equations for angular acceleration are inverted 
to obtain the effective control in each axis. Figure 4 also shows the NN and the PCH signals, which are further 
discussed in Section V and VI. 


Pilot's 



Figure 4. Command augmentation system with NN. 


A. Outer-Loop Controller 

The outer-loop controller produces a pitch rate command Q c and a yaw rate command R, such that the lateral 
acceleration remains close to zero, which provides turn coordination. The dynamic compensator has a proportional 
plus integral (PI) form: 


Qc= K \\ a nc ~ a n) + K 2 ' |q ( a nc ~ a n ) dT 

R c =~K 3 ■ a y -K 4 -j; a y dr 


(9) 


where the feedback gains K t , K 2 , K 3 , and K 4 can be selected based on speed of response. 


B. Command Filter (Reference Model) 

A first order reference model is introduced to generate reference signals in each channel. For instance, for roll 
channel it is 



1 

r-s + l 


(10) 


where r is the desired roll mode time constant. In this process, pseudo-control hedging is incorporated to handle 
control nonlinearities. 
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C. Dynamic Compensator and Control 

The derivatives of the body angular angles can be described as, designating the approximate mathematical 
models P, Q, R with the pseudo controls, v p , v „ , v r : 
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Inverting Eq. (11) resulting in the control laws: 
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( 12 ) 


The controls in Eq. (12) are based on the simplified functions F,{x), i =1,2,3, which retain only a few dominate 
terms: 
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(13) 


(14) 


Here the aerodynamic coefficients {C[ p ,C ma , etc) and control effectiveness ( Cig cl ,C m § e ,C n g r ) are set to constant 
values within reasonable ranges. These approximations introduce modeling error. 


The exact expression for P can be, for instance, written by 


where 


P = Cy 



Q=Q 0 + c //r/? + Q 
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(15) 

(16) 


Because only a few dominant terms among those in Eq.(16) are retained in Eq.(ll) and (13), there always exists a 
modeling error A p defined by 

A p =P-P=P-v p (17) 


Modeling errors A (/ and A ; . in pitch and yaw channels, respectively, can also be defined by the same way. Using 
these definitions, the time derivatives of angular rates can be described by: 
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P - P + A p — v p + A p 

Q = Q + A q = v q + A q (18) 

R = R + A } . = v r + A r 

The equations in Eq.(ll) can be transformed to a linear, time invariant form by designating the pseudo-controls 
including only proportional control laws: 

v p = A V (Pc ~ P) ~ v adp 

v q = A 2 ' (Qc ~ Q)— v adq (! 9 ) 

v r ~ A 3 ' (Rc ~ P) ~ V adr 

where v a( j p , v adq , v adr are adaptive signals which are the output of neural networks as shown in Fig. 4. Substitution 
Eq. (19) into Eq. (18) gives 


P - a y(P c ~P)~ v adp + ^ P 

Q=A 2 \Q c -Q)-v adq +Aq (20) 

p = A y (R c - R) - v adr +A r 


Hence, if the NN adaptive signals ( v adp ,v adq ,v adr ) cancel out the modeling errors ( A p , A , A r ), then asymptotic 

tracking in body angular rates can be expected. Consequently, the neural networks play the key role of generating 
the adaptive signals to compensate for the modeling errors due to the use of approximate models, uncertainties in 
each channel. Feedback gains A\,A 2 ,At > are chosen to satisfy the desired handling qualities. 


D. Output Feedback Design 

Most of UAV implement simple, fundamental measuring instruments, so only a set of limited parameters is 
usually available for feedback. Thus, an output feedback design of the control system should be considered for such 
cases. 


As a further simplification, one can ignore angle of attack (a) and sideslip angle (/:!) dependencies, and then the 
terms including these angle values in Eq.(13) could be omitted, and in addition the x-axis speed component, U may 
be replaced by the total speed V as shown below: 


Sqb 

F \ (x) = —C lp ■ 



F 2 (x) = 



C„ 



F i(x) 


A. U V) 


(21) 


According to the theoretical background on the adaptive output feedback design presented in Ref. 5, the delayed 
signals in each channel are input to the NN adaptive elements (see Section VI). 

V. Pseudo-Control Hedging (PCH) 

PCH is used to address NN adaptation difficulties arising from various actuation nonlinearities, including 
actuator position and/or rate saturation, discrete (magnitude quantized) control, time delays and actuator dynamics 6 . 
NN training difficulties occur when unmodeled actuator characteristics are encountered. For example, the NN 
adaptive element will attempt to adapt to these nonlinearities, even when it is impossible to do so. The goal of PCH 
is to prevent the adaptive element from attempting to adapt to these characteristics, while not affecting NN 
adaptation to other sources of inversion error. Conceptually, PCH “moves the reference model backwards” by an 
estimate of the amount the controlled system did not move due to selected actuator characteristics (such a position 
and rate limits, time delays, etc). The reference model is hedged according to an estimate of the difference between 
the commanded and actually achieved pseudo-control. 
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The hedge signal is defined as 


V h =v-v 


(22) 


where v is the commanded pseudo control and v is an estimate for the achieved pseudo control. For the design 
approach in Section III, for example, vis defined in Eq. (5) and the estimate is obtained by combining Eq. (3) and 
(4) and replacing the elements of u 2 in Eq. (4) by estimates obtained from actuator models of the form depicted in 
Fig. 5. Thus, 


v h = v -\F(x) + G{x)A(x) + G{x)B(x)-u 2 ] (23) 

The elements of the hedge signal are then subtracted in the reference models for each respective axis. The 
manner in which this is done for a second order reference model is depicted in Fig. 6. 
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Limit*dt 
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Figure 5. Actuator estimator. Figure 6. Reference model with hedging in pitch channel. 


VI. N eur al N etwork Adaptation 

The modeling error A depends on the states and the pseudo control. It has been shown that this error can be 
approximated 5 , in a bounded region, to any desired degree of accuracy using a NN with a sufficient number of 
hidden layer neurons, having the following input vector 

/*(*)= n v l (0 y T d Wi 

where 

Vd{t )=[i'(t) v(t-d) ■■■ v{t-{n x -r-\)d)] T 

tJ(0=[>’(0 yit- d ) ■■■ y{t-(ih-\)d)] T 


(24) 

(25) 


with «i>«, d > d > 0 denoting time delay. 

In the case of a single hidden- layer, multi-perceptron NN shown in Fig. 7, we have 

v ad = W T a(v T fi) ( 26 ) 


where cr is a vector whose elements, <t,{z,), are the basis functions of the NN. Typically, these basis functions are 
selected as so-called squashing functions. The form we employed is a j {z t ) = l/(l + e ~ aM ) , where o, is the activation 
potential. The network weights are updated according to the following adaptation laws: 


v = -r v \ 
w = -r w 


2jue T PbW T a' + AV \ 
2{a-a'V T /j)e T Pb + AW 
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where a = a(V T p) and (t'(x) = diag(dcr i / dz i ) , e is the tracking error vector, P is the positive definite solution to 

the Lyapunov equation A T P + PA = -Q with Q > 0, A is a Hurwitz matrix, and T v and T w are adaptation gains. It 
has been shown that the adaptive laws given in Eq. (27) guarantee that all error signals and network weights are 
uniformly bounded 5 . 


V W 

/ \ / \ 



Figure 7. Single Hidden Layer (SHL) perceptron neural network 
VII. Simulations 

The simulation model was constructed using Matlab/Simulink 1 ' implementing the UAV’s preliminary 
configuration data, mass property and static wind tunnel data which covers angle of attack from -6 to 20 degrees 
and sideslip angle from -16 to 16 degrees, along with the assumed dynamic damping derivatives: C = -1.0, 

Ci p =- 0.25, C,„.=-0.1, C ma = 0. The aircraft trim conditions are: speed V T =31.0 m/s, altitude hf= 122.0 m, trim 

angle of attack <r/ T =-2.8 1 6 degrees, and trim sideslip angle f3j=- 0.541 degrees. The trimmed throttle set is 0.44, and 
it is assumed to be constant during simulations. All aerodynamic control deflections range from -25 to 25 degrees 
with rate limits of ±120 deg/sec. All simulations begin from this trim condition. 

A. Model of Atmospheric Turbulence 

The flight envelope of the UAV involves mostly low altitude where gusts or turbulences are common, hence a 
model of the turbulence needs to be implemented in control simulations. The theory of stochastic processes provides 
a convenient mean for describing atmospheric turbulence accurately enough for most simulations. For simulation 
purposes it would be practical to model atmospheric turbulence as white noise passing through a linear filter as 
shown in Fig. 8, where the relationship between the auto-spectral density of the output signal and the one of the 
input signal is linear 15 . 


White noise 

Linear Filter 

Turbulance 

velocity 


(Dryden) 



Figure 8. Atmospheric turbulence model as a filtered white noise 


Using the Dryden spectra which were developed to approximate the von Karman turbulence spectra by means of 
rational functions, it is possible to model filters generating turbulence velocity components from white noise inputs. 
These filters were implemented in the simulation model using Matlab/Simulink. 


B. Control Design 1: Two-Stage Dynamic Inversion Based Adaptive Control Design 

The control design was carried out assuming that the pilot commands a, // and p s . The roll channel is relative 
degree one (r = 1) with respect to the control, while both the a and // channels are relative degree two. The details of 
this design are written in Ref. 2 and 1 1 . 
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Since r =1 in the roll channel, a 1 st order reference model is employed for that channel, with a time constant of 
0.3. Likewise, 2 nd order reference models are employed in the pitch and directional channels, with a >„= 5 rad/sec and 
4=1.0. The values selected for the NN gains, defined in Eq.(25), and the number of hidden layer neurons, n„, are 
given in Table 1. The activation potentials (a,) were uniformly distributed between 0.1 and 0.5. In addition, the first 
NN basis function was used to provide a bias term (a 0 = 0). 


Table 1. NN parameters for control design 1. 


channel 

r > 

r w 
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n„ 

ni 
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0.3 

0.1 

10 
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0.01 
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1.0 
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0.1 
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0.01 
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0.5 

0.5 

0.1 

10 

5 

0.01 


Angle of Attack Maneuver 

Simulation results are presented in Fig. 9-11 for a 12° angle of attack command with zero sideslip and p s . Figure 
9 presents the a, /? and p s responses with and without adaptation (NN/PCFI) for the command. As shown in Fig. 9(a), 
at time 0, a begins from its trim value and initially keeps this value. Subsequently at 5 seconds, a command of 12° is 
applied. With NN/PCFI, (/-response follows its reference signal without any overshoot, while J3 and p s responses in 
Fig. 9(b) exhibit oscillations with moderate magnitudes. Flowever, the (/-response cannot follow the command 
without NN/PCFI. It can be clearly seen that good tracking is achieved for the vehicle with adaptation. 

Time histories of aerodynamic controls for cases with and without adaptation are depicted in Fig. 10. The NN 
adaptation signal q„/t) and inversion error A(t) for all channels are compared in Fig. 11. This represents a measure 
of the degree that adaptation is able to compensate for the inversion error, so it can be seen that NN is presenting its 
excellent performance all over the simulation period. 

Stability Axis Roll Rate (p s ) Maneuver 

Simulation results for command of p s = 150 °/sec while maintaining trim angle of attack are depicted in Fig. 12- 
14. Figure 12 shows aircraft responses for cases with and without adaptation. It can be seen that good tracking is 
also maintained in this case with adaptation, except for the transient oscillations in roll response at about 8 and 1 1 
seconds. Without adaptation, a-response diverges at the initial phase. 

Figure 13 depicts time histories of aerodynamic controls for cases with and without adaptation. Figure 14 
compares v a d(t) and A(t) for all three channels. It can be seen that NN compensates for the inversion error precisely. 

The simulation results indicate that the UAV has very agile roll maneuverability, and that it can be greatly 
enhanced by the adaptation. 


C. Control Design 2: Command Augmentation based Adaptive Control Design 


1. State Feedback 

The feedback gains K h K 2 , Kj, and K 4 in Eq.(12) are chosen in accordance with the following equations: 

k 2= k *~, K x =T n K 2 , K 3 =r y K 4 

where 

2 m 2 m 

Tn = pSV T C La ’ Ty = pSV T C y p 


(26) 

(27) 


The time constants t, in Eq.(10), for pitch and roll channels, are set to r = 1/2 4®„ , and for yaw channel it is set to 
r = 2/4®„ . The natural frequency)®,,) and damping ratio) A) are set to 5 rad/sec and 2.0, respectively, for 
simulations. The gains A X ,A 2 ,A 3 in Eq.( 15) are chosen as: 


4=25, 4=20, 4=10 
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The values selected for the NN gains and the number of hidden layer neurons, /?„, are given in Table 2. The 
activation potentials (a,) were uniformly distributed between 0.1 and 0.5. In addition, the first NN basis function 
was used to provide a bias term (a 0 = 0). 


Table 2. Neural network parameters for state and output feedback. 


channel 

r v 

r w 

A 

n„ 

ni 

d 

P 

3.0 

3.0 

0.01 

10 

5 

0.01 

Q 

3.0 

5.0 

0.01 

10 

5 

0.01 

R 

3.0 

1.0 

0.01 

10 

5 

0.01 


Normal Acceleration (a„) Maneuver 

Simulation results using state feedback are presented in Fig. 15-19 for a l±0.8g normal acceleration («„) 
command with zero a y and p. At 4 seconds, a command of 0.2 g is applied and next at 7 seconds, 1.8 g is 
commanded. Figure 15 presents normal acceleration («„), lateral acceleration (a,,) and roll rate (p) responses with 
and without adaptation (NN/PCH) for the command. As shown in Fig. 15(a), a n follows its command with little 
overshoots with adaptation. The roll rate (P) and lateral acceleration ( a y ) shown in Fig. 15(b) are maintained very 
close to zero command with adaptation, except for the short periods of transient response. It is noted that, without 
adaptation, normal acceleration, a n shows bigger overshoot, and P and a v responses are not kept zero as commanded 
and they even diverge from commanded zeros along with yaw rate (R) in Fig. 16. These results are highly 
undesirable. Pitch rate (Q) and yaw rate (R), presented in Fig. 16, follow the reference signals with adaptation. 
Angle of attack and sideslip angle are depicted in Fig. 17, and they show difficulty in returning to trim conditions 
without adaptation. Time histories of control deflections are presented in Fig. 18, and it is noted that PCFI is 
activated for elevator at right after 4 and 7 seconds because of the actuator rate limits. The NN adaptation signal 
v a JX) and inversion error A(t) for all channels are compared in Fig. 19. It can be seen that NN output, i/„/(t) is nicely 
canceling out the error A(t) all over the simulation period. 

Roll Rate ( P ) Maneuver 

Simulation results using state feedback for command of P = ±150 °/sec, while maintaining 1.0 g a„ and zero a y , 
are depicted in Fig. 20-24. At 4 seconds, a command of + 150 °/sec is applied and next at 7 seconds, -150 °/sec g is 
commanded. Figure 20(a) shows roll rate response for cases with and without adaptation. It can be seen that good 
tracking is maintained with adaptation, while it shows bigger error without adaptation. Normal and lateral 
accelerations are depicted in Fig. 20(b) and they follow the commanded values, one and zero, respectively. Pitch rate 
(Q) and yaw rate (R) are presented in Fig. 21, where it can be seen that Q and R follow the reference signals even 
without adaptation. Angle of attack (or) and sideslip angle (f:l) are depicted in Fig. 22, and P shows bigger transient 
amplitude with adaptation. Figure 23 shows time histories of control deflections, and the NN adaptation signal v ac {\) 
and inversion error A(t) for all channels are compared in Fig. 12. It can be seen that NN is showing good adaptation. 

2. Output Feedback 

The same parameter settings and NN gains in Table 2 as those for state feedback in previous subsection are used 
for output feedback case, except for using Eq.(16) instead of Eq.(13). As noted in earlier section, it is assumed that a 
and P are not available. 

Normal Acceleration (a„) Maneuver 

Simulation results using output feedback are presented in Fig. 25-29 for a l±0.8g normal acceleration («„) 
command with zero a y and p. Likewise state feedback case, at 4 seconds, a command of 0.2 g is applied and next at 
7 seconds, 1.8 g is commanded. Figure 25 presents a„, a v and p responses with and without adaptation (NN/PCFI) 
for the command. Pitch rate (Q) and yaw rate (R), presented in Fig. 26, follow the reference signals with adaptation. 
Angle of attack and sideslip angle are depicted in Fig. 27. Overall responses for this command are similar to those of 
state feedback case. Time histories of control deflections are presented in Fig. 28, and the NN adaptation signal 
v a J<f) and inversion error At/) for all channels are compared in Fig. 29. It can also be seen that NN output, v a Pi) 
shows good adaptation by nicely canceling out the error At/) all over the simulation period. 
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Roll Rate ( P ) Maneuver 

Simulation results using output feedback for command of P = ±150 °/sec, while maintaining 1.0 g a„ and zero a v , 
are depicted in Fig. 30-34. Likewise state feedback case shown in previous subsection, at 4 seconds, a command of 
+150 °/sec is applied and next at 7 seconds, -150 °/sec g is commanded. Figure 30(a) shows roll rate response for 
cases with and without adaptation. Normal and lateral accelerations are depicted in Fig. 20(b), and pitch rate (Q) and 
yaw rate (R) are presented in Fig. 31. Angle of attack (a) and sideslip angle (/ 3) are depicted in Fig. 32. Like the 
normal acceleration maneuver, the overall responses of this command are also similar to those of state feedback case. 
Figure 33 shows time histories of control deflections, and the NN adaptation signal v a Jf) and inversion error A(t) for 
all channels are compared in Fig. 34. It can be seen that NN is showing good adaptation by canceling out the error 
A(t) in each channel. 


VIII. Conclusion 

Two NN-based adaptive control designs for the FQM-117B UAV are discussed: One with two-stage dynamic 
inversion with state feedback, and the other with a feedback dynamic inversion based on a command augmentation 
system with state and output feedback. The tracking performances of both approaches are greatly improved by the 
NN-based adaptive control design, thereby implying adaptation to modeling error and uncertainties. Pseudo-control 
hedging is implemented to protect the adaptive process during periods of control nonlinearities such as position 
limits and rate limits. 
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Figure 9. Aircraft responses for an cr-command with/without NN adaptation. 
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Figure 10. Aerodynamic control defections 


Figure 11. NN adaptation signal v arf (t) and A(t) 
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Figure 12. Aircraft responses for a p s -command with/without NN adaptation. 
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Figure 15. Aircraft responses for a normal acceleration (a„) command using state feedback 

with/without NN adaptation. 






Figure 16. Pitch Rate, Q and Yaw Rate, R. 


Figure 17. Angle of attack and sideslip angle. 
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Figure 18. Aerodynamic control defections. 


Figure 19. NN adaptation signal v ad (t) and A(t). 
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Figure 20. Aircraft responses for a roll rate (P) command using state feedback 
with/without NN adaptation. 
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Figure 22. Angle of attack and sideslip angle. 
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Figure 23. Aerodynamic control defections. 


Figure 24. NN adaptation signal v ad (t) and A(t). 
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(a)Normal acceleration ( a „ ) 
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Figure 25. Aircraft responses for a normal acceleration (a„) command using output feedback 

with/without NN adaptation. 





Figure 27. Angle of attack and sideslip angle. 
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Figure 28. Aerodynamic control defections. Figure 29. NN adaptation signal v ad (t) and A(t). 




(a) Roll rate (P) (b) Normal acceleration (a„) and lateral acceleration (a y ) 

Figure 30. Aircraft responses for a roll rate (P) command using output feedback 

with/without NN adaptation. 
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Figure 32. Angle of attack and sideslip angle. 
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Figure 33. Aerodynamic control defections. 


Figure 34. NN adaptation signal v ad (t) and A(t). 
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